Semantic Annotation and Lexico-Syntactic Paraphrase
نویسندگان
چکیده
The IAMTC project (Interlingual Annotation of Multilingual Translation Corpora) is developing an interlingual representation framework for annotation of parallel corpora (English paired with Arabic, French, Hindi, Japanese, Korean, and Spanish) with deep-semantic representations. In particular, we are investigating meaning equivalent paraphrases involving conversives and non-literal language use, as well as extended paraphrases involving syntax, lexicon, and grammatical features. The interlingua representation has three levels of depth. Each level is characterized by the types of meaning equivalent paraphrases that receive identical representations at that level.
منابع مشابه
Italian Treebank lexico semantic annotation and reference lexical resource
The paper reports on the lexico semantic annotation level of the Italian Treebank the rst Italian corpus with a multi level anno tation morpho syntactic syntactic and lexico semantic The strategy of annotation and the reference lexical resource are described and the results achieved too
متن کاملParaphrase Identification Using Weighted Dependencies and Word Semantics
We present in this article a novel approach to the task of paraphrase identification. The proposed approach quantifies both the similarity and dissimilarity between two sentences. The similarity and dissimilarity is assessed based on lexico-semantic information, i.e., word semantics, and syntactic information in the form of dependencies, which are explicit syntactic relations between words in a...
متن کاملExtraction of Lexico-Syntactic Information and Acquisition of Causality Schemas for Text Annotation
We present the INSYSE method for the annotation of texts, based on extraction of semantic relations from syntactic structures. We apply this method to a corpus of 5000 Medline abstracts about central nervous system diseases and gene interactions. Our cooperative approach focuses on (1) extracting lexico-syntactic information from sentences in the corpus comprising causation lexemes and (2) elab...
متن کاملThe Copenhagen Dependency Treebank (CDT) Extending syntactic annotation to morphology and semantics
This paper has two main objectives. The first is to provide an overview of the CDT annotation design with special emphasis on the modeling of the interface between syntactic and morphological structure. Against this background, the second objective is to explain the basic fundamentals of how CDT is marked-up with semantic relations in accordance with the dependency principles governing the anno...
متن کاملAutomatic Generation of Interactive Cooking Video with Semantic Annotation
Videos are one of the most frequently used forms of multimedia resources. People want to interact with videos to find a specific part or to obtain relevant information. To support user interactions, current videos should be transformed to interactive videos. This paper proposes an interactive cooking video system to generate automatically interactive cooking videos. To do this, the proposed sys...
متن کامل